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Abstract 

pH We consider multi-input multi-output (MIMO) communications over multi-mode fibers (MMFs). Current MMF 

l-H 

• standards, such as OM3 and OM4, use fibers with core radii of 50 /im, allowing hundreds of modes to propagate. 

O Unfortunately, due to physical and computational complexity limitations, we cannot couple and detect hundreds of 
data streams into and out of the fiber. In order to circumvent this issue, two solutions were presented in the literature. 

►^^ The first is to design new fibers with smaller radii so that they can support a desired number of modes. The second 

^S) is to design multi-core fibers with a reasonable number of cores. However, both approaches are expensive as they 

^ I necessitate the replacement of currently installed fibers. Moreover, both approaches have limited future scalability. In 

(*^ our work, we present input-output coupling schemes that allow the user to couple and extract a reasonable number of 

■^^ signals from a fiber with many modes (such as those used in 0M3 and OM4 standards). This approach is particularly 
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attractive as it is scalable; i.e., the fibers do not have to be replaced every time the number of transmitters or receivers 
is increased, a phenomenon that is likely to happen in the near future. In addition, fibers with large radii can support 
higher peak powers relative to fibers with small radii while still operating in the linear regime. However, the only 
concern is that fibers with more modes suffer from increased mode-dependent losses (MDLs). Our work addresses 



?— j this last concern. 



We present a statistical channel model that incorporates intermodal dispersion, chromatic dispersion, mode 
dependent losses, and mode coupling. We show that the statistics of the fiber's frequency response are independent of 
frequency. This simplifies the computation of the average Shannon capacity of the fiber. We later extend this model 
to include input and output couplers and provide an input-output coupling strategy that leads to an increase in the 
overall capacity. This strategy can be used whenever channel state information (CSI) is available at the transmitter and 
the designer has full control over the couplers. We show that the capacity of an Nt x Nt MIMO system over a fiber 
with M ^ Nt modes can approach the capacity of an A'^t-mode fiber with no mode-dependent losses. Moreover, we 
present a statistical input-output coupling model in order to quantify the loss in capacity when CSI is not available 
at the transmitter or there is no control over the input-output coupler. It turns out that the loss, relative to A^t-mode 
fibers, is minimal (less than 0.5 dB) for a wide range of signal-to-noise ratios (SNRs) and a reasonable range of 
MDLs. This means that there is no real need to replace the already installed fibers and that our strategy is an attractive 



approach to solving the above problem. 

I. Introduction 

Since Shannon defined the notion of channel capacity as the fundamental limit on achievable transmission rates 
with vanishing probability of error, system designers have attempted to reach this limit by leveraging device tech- 
nology advances and increasingly sophisticated algorithms and architectures. Moore's law, together with advances 
in signal processing, information theory, and coding theory have enabled us to essentially achieve this fundamental 
limit for a number of narrow-band wired and wireless communication links. Because of their superior bandwidth- 
distance product, optical fibers have become extremely popular and have largely replaced traditional copper wire 
technologies. Optical communication links can support serial data rates that are typically several orders of magnitude 
higher than their wired or wireless electrical counterparts, such as voice-band or cable modem technology or even 
high-speed chip-to-chip serial links. Despite their superiority, optical links have limited capacity and the circuits, 
signal processing, and information theory communities need to completely re-think the design and analysis of 
optical communication systems in order to address the ever increasing demand for Internet bandwidth. Multi-input 
multi-output (MIMO) communications over multi-mode fibers (MMFs) holds the promise of improving bandwidth 
efficiency. However, the capacity of MIMO optical links has not been fully investigated due to the lack of accurate 
and mathematically tractable channel models. In this paper, we present a detailed linear model for the MIMO 
multi-mode optical channel and analyze its capacity as a function of input-output coupling as well as other physical 
parameters. We also introduce an input-output coupling strategy and compare it to the uncontrolled coupling case 
in terms of achievable rate. 

A. Motivation 

In an information-intensive era, the demand for Internet bandwidth is increasing at a rate of 56% per year, while 
the increase in supply is falling behind at a rate of 25% per year |1|. The increase in demand is fueled by the 
boom in web-based data services such as cloud computing and real-time multimedia applications. As a result, 
optical fiber communication researchers are looking into new ways of boosting the transmission rate of optical 
links. Given that polarization division multiplexing (PDM) and wavelength division multiplexing (WDM) have 
already been exploited Q, the only remaining degree of freedom is space division multiplexing [S)- MIMO optical 
communication increases the transmission rates of MMF systems by multiplexing a number of independent data 
streams on different spatial modes. Note that, unlike WDM systems, all the laser sources in this case have the same 
wavelength. MMF is a dominant type of fiber used for high speed data communication in short-range links such as 
local area networks (LAN) and data centers ||4l. It is usually favored over single-mode fibers because of its relaxed 
connector alignment tolerances and its reduced transceiver connector costs. Plastic optical fibers are great examples 
of MMFs with remarkably low installation and operation costs (|5l. However, they suffer from mode-dependent 
losses, mode coupling, intermodal dispersion, and chromatic dispersion (group velocity dispersion) ||6l. All these 



phenomena will be explained in detail in Section II-A These limitations make the design and analysis of MIMO 
multi-mode systems challenging yet exciting. 

B. Literature Review 

Following the work of Shannon Q, many information theorists investigated the capacity of different channels, 
including single-input single-output (SISO) channels with memory, channels with constrained input alphabet, and 
multiple-input multiple-output (MIMO) channels. In their seminal works, Telatar and Foschini et al., independently 
showed that the capacity of a MIMO flat fading wireless channel, under the Raleigh fading model, scales linearly 
with respect to the minimum number of antennas at the transmitter and receiver JH, lID. Since then, the wireless 
communications community has been focused on developing detection and coding schemes for MIMO systems in 
order to achieve the aforementioned capacity gains. Recent wireless technologies such as WLAN 802.1 In and Long 
Term Evolution Advanced (LTE-A) are examples of MIMO systems deploying up to 8 transmitters and receivers. 
More importantly, this MIMO technique is not limited to wireless systems. 

Loosely speaking, the number of degrees of freedom (DoF) of a channel is an upper limit on the number of 
independent data streams that can be transmitted through the channel over a period of time. A more rigorous 
definition of DoF is given in |10| as the minimial dimension of the received signal space. The quadrature and in- 
phase components of a passband information signal are two familiar and commonly exploited degrees of freedom in 
wired and wireless communication systems. Frequency, time, code, quadrature, and polarization states are all well 
explored and already utiUzed in commercial optical systems. However, the spatial degree of freedom, which is unique 
to MMFs, has not been exploited yet in commercial products and is still under research. In 2000, H. R. Stuart was the 
first to notice the similarity between the multipath wireless channel and the MMF optical channel and suggested 
using the spatial modes to multiplex several independent data streams onto the fiber ifTTI . Prior to this finding, 
single-mode fibers were always considered to be superior to MMFs because of their improved bandwidth-distance 
product (as SMFs do not suffer from intermodal dispersion). However, we will show in Section [Hi] that MMFs 
have advantages over single-mode fibers from an information theoretic capacity perspective. Therefore, MIMO over 
MMF seems to be a better route to higher data rates. In fact, Stuart was the first to demonstrate the feasibility 
of a 2 X 2 MMF system and to show that there are indeed some capacity gains to be leveraged ifTTI . However, 
Stuart's analysis and experiments assumed a radio frequency sub-carrier (~ 1 GHz) instead of an optical carrier 
(^ 100 THz). This assumption was later relaxed in the work of Shah et al. but their treatment did not account 
for any intermodal dispersion, chromatic dispersion, or mode coupling llT2ll . Recently, the information theoretic 
capacity of coherent MMF systems has been studied in O, where the authors ignored the frequency selectivity of 
the channel but incorporated the effects of mode coupling. In |fT3l . Keang-Po et al. considered the capacity of a 
frequency selective MMF channel at a particular frequency. They later studied the impact of frequency diversity 
on the channel capacity for mutli-mode fibers with 10 modes fM]. However, their models did not incorporate the 
effect of mode-dependent phase shifts or chromatic dispersion. 



C. Outline and Contributions 

In Section III] we present a MIMO channel propagation model that takes intermodal dispersion, chromatic 



dispersion, mode-dependent losses, and mode coupling into account. In Section III we compute the Shannon 



capacity of an M-mode fiber and demonstrate how mode-dependent losses and mode coupling affect it. In Section 



IV we analyze the coupling of a reasonable number of laser sources to a fiber with hundreds of modes. We also 
propose an input-output coupling model and present a coupling strategy: using the input-output couplers to perform 
a particular type of beamforming. This strategy allows the effective transmission of data along the least lossy subset 
of end-to-end eigenmodes. The resultant capacity is almost equal to that of a fiber with Nt modes and no modal 
losses, an ideal case which maximizes the capacity of an Nt x Nt MIMO system. This coupling strategy can 
only be used when channel state information (CSI) is available at the transmitter and there is full control over the 
input-output couplers. In the absence of these conditions, an appropriate random input-output coupling model is 
used in order to better model the behavior of the system and quantify the expected loss in the fiber's capacity. It 
turns out that the loss, relative to A^t-mode fibers, is minimal (less than 0.5 dB) for a wide range of SNRs and a 
reasonable range of MDLs. 

D. Random Unitary Matrices 

In this section, we provide a brief overview on random unitary matrices and discuss the isotropic invariance 



property that will prove useful when we compute the capacity of the fiber in Section III We define U (Af ) := 
{U e C*^^^^|U*U = UU* = Im} to be the space of M x M unitary matrices. If the distribution of an M x A^ 
random matrix is invariant to left (right) multiplication by any M x M (N x N) deterministic unitary matrix, it 
is called left (right) rotationally invariant. Assume the probability distribution function (pdf) / (A) of an ilf x A^ 
random matrix A exists, A is left rotationally invariant if 

/uA (UA) = / (A) (1) 

where U e U (M), and A is right rotationally invariant if 

/av (AV) = / (A) (2) 

where V e U (A^). A random matrix is isotropically invariant if it is left and right rotationally invariant. It turns out 
that U (M) forms a compact topological group, and thus a unique uniform measure (up to a scalar multiplication), 
called Haar measure, can be defined over U (M) ifTSl . llT6ll . Random unitary matrices of size M x M are random 
matrices sampled uniformly from U (M). 

Lemma 1.1: The pdf of A, an M x M random unitary matrix, is isotropically invariant lfT6l . 
This lemma will be helpful in the following section. 



II. Fundamentals and Modeling 

Coherent systems use well calibrated phase controlled laser sources and local oscillators operating well in the 
terahertz regime (hundreds of THz) to transmit and recover the phase and amplitude of an information-bearing 
signal. On the other hand, non-coherent systems use simple LEDs and photo detectors to transmit and detect the 
energy of an information signal. Thus, coherent systems are more complex, more expensive, and harder to build and 
maintain when compared to non-coherent systems. This is why the majority of currently deployed optical systems 
are non-coherent while a small percentage of the high end systems are coherent. However, coherent systems are 
becoming more popular as the optoelectronic devices are becoming more affordable. In fact, the state-of-the-art 
optical systems use both polarization and quadrature multiplexing to multiply the data rate by a factor of four For 
example, OC-768 systems use dual polarizations in addition to quadrature phase-shift keying (QPSK) to multiplex 
four independent data streams and transmit them all at the same time. The OC-768 network has transmission speeds 
of 40 Gbit/s. This means that in a system using QPSK and dual polarization, the transmitter operates at a frequency 
of about 10 GHz. Because coherent systems are becoming more popular and affordable, the capacity analysis we 
perform in Section [Hi] is exclusively applicable to coherent systems. 

Electromagnetic waves propagating inside the core of a fiber are characterized by Maxwell's equations. When 
the core radius is sufficiently small, only one solution to the wave equations is supported and the fiber is said to 
be a single-mode fiber. In multi-mode fiber systems, the core radius is relatively large and hence there is more 
than one solution (propagation mode) to the wave equation Q. Ideally, the field inside the core would propagate 
in different orthogonal modes that do no interact with one another. However, due to manufacturing non-idealities 
and index of refraction inhomogeneities, the modes may couple. This phenomenon is called mode coupling and is 
modeled in Section Hl-AI 

A. Fiber Propagation Model 

For coherent optical systems operating in the linear regime, the basic form of the baseband transfer function 
governing the input-output relationship of the i*'^ mode is given by 

H, {x,y, z,u:) = 4>; {x, y,uj) e-¥e"Jft("+"c). (3) 

where ujc is the laser's center frequency, 0^ {x,y,uj) is the transverse function (spatial pattern) of the z*'* mode, Ki 
is the mode-dependent attenuation factor, and /3i {uj + lJc) is the i*'^ mode's propagation constant ifTTll . Expanding 
the function /3i (w) around ujc using its Taylor series expansion, and keeping the first and second order derivative 
terms, we get 

where 0i {x, y) = 4>i {x, y, L, Uc), Qi = -kiL, 0i = j3i (w^) L, n = (3[ (uc) L, and at = (3'- (ojc) L. Observe that z 
has been suppressed as it has been evaluated at L, the fiber's length. The function (pi {x, y, oj) generally depends 
on u! but since the signal spectrum (tens of GHz) is narrow around the laser's center frequency (hundreds of THz), 



we drop this dependency and evaluate it at cj^- The model in (HI assumes that the propagation of the mode is 
completely characterized by a second order linear model where the only phenomena exhibited along the i*^ mode 
are 

• mode-dependent loss (MDL): gi — ^KiL 

• mode-dependent phase shift (MDPS): 9i = j3i (wc) L 

• group delay (GD): r^ — /3j (wc) L 

• group velocity dispersion (GVD): on = /3j {ujc) L 

The mode-dependent losses (MDLs) are negative quantities describing the attenuation experienced by the modal 
fields. On the other hand, the mode-dependent phase shifts (MDPSs) represent phase shifts experienced by the 
modal fields. In general, modal fields propagate at different speeds and thus the group delays (GDs) characterize 
the arrival times of different modes. Therefore, if we transmit a narrow pulse through the fiber, it would appear as 
a pulse having a width of Td — maxi.j{|ri — tj\] at the output of the fiber The quantity Td is referred to as the 
channel's delay spread. Assume, without loss of generality, that the group delays are sorted in increasing order, ti 
being the smallest and tm being the largest. In this case, Td is given by 



Td 



max{|Ti - Tj|} 



Tm - Ti 



L Pm (wc) - Pi (Wc) 



(5) 



Thus, Td is directly proportional to the length of the fiber. The pulse broadening phenomenon, due to nonzero 
Td, is called intermodal dispersion and is a serious performance limitation in MMF systems. The group velocity 
dispersion (GVD), also called chromatic dispersion (CD), suggests that different frequencies coupled to the same 
mode propagate at different speeds and hence broadening occurs to the field propagating in a particular mode. This 
phenomenon is called intra-modal dispersion. In a first-order model, intermodal dispersion is assumed to dominate 
over intra-modal dispersion and the GVD term is typically neglected, especially for shorter lengths L. Furthermore, 
since we are not interested in analyzing the field at every point (x, y) in the fiber's core, we suppress this term to 
obtain the following expression: 



Hi (w) ex e*e-^"'e-^"^*e-^"'"^ 



(6) 



Ideally, the field at the output due to the i*'' mode is given by Vi [t) = Si* hi (i), where Si {t) is the field at the 
input due to the same mode. Thus, the frequency domain vector representation of the modal fields at the output of 
the fiber is given by 



ri(w) 

TM (w) 



iJiH 



Hm (w) 



Si (cj) 
SM (w) 



(7) 
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Fig. 1: A multi-mode fiber with K propagation sections 



where the off-diagonal entries are zero because the modes are assumed to be orthogonal. This analysis neglects the 
existing fiber aberrations such as fiber bends, index of refraction inhomogeneities, and random vibrations, and is 
therefore incomplete. In fact, the modes interact with one another and exchange energy as they propagate along the 
fiber, compUcating the analysis of the wave propagation. The treatment we present was first applied to polarization 
mode dispersion (PMD) in ifTSl and was then generalized to model mode coupling in ifTSl . In the regime of high 
mode coupling, for example when plastic optical fibers are used, an MMF with M modesH is split into K ':$> 1 
statistically independent longitudinal sections as depicted in Figure [T] The number of sections K is equal to L/lc, 
where Ic represents the correlation length of the fiber The frequency response of each section is given by 

H'= (w) = U'^A'^ (w) V''^' for k^l,...,K (8) 

where U*^ and V'' are M x M frequency-independent projection matrices (unitary matrices) describing the modal 
coupling via a phase and energy shuffling process at the input and output of each section and 

a'' (w) = diag (e^Si-jel-ji^T^-ji^^al^^^^ gig^j-je^.-ii^r^j-jc^^Q^A ^g^ 

is the propagation matrix describing the ideal (uncoupled) field propagation in the A;*'' section. This model assumes 
that mode coupling occurs at the interface of different sections while the propagation in each section is ideal 
(and is described by A*^ (w)). In (Wll, the vectors g*^ = (g^, ...,5^^), O'' — (6'i, ..., 6*^^), r'^ — (rf , ...,t|j), and 
a'' — (^ai,...,a'l.i) represent the uncoupled MDL, MDPS, GD, and GVD coefficients in the fc*'' section. Here, 

g^ — —K^lc, 6^ = /3,f (ujc) Ic, Ti ~ Pi' i^c) Ic, and a^ = f3^ ^ (ujc) Ic are not necessarily identical across the M 



modes and K sections and will be modeled as random variables in Section II-B The overall channel frequency 



response is equal to the product of the frequency responses of the K sections and is given by 

H(w)=H(^)(a;)...H(i)(a;) (10) 

Alternatively, one could describe the input output relationship in time domain by 

H(t) - H(^) *H(^-i)...H(2) *H(i) (i) (11) 

' In this work, M refers to all the available spatial degrees of freedom including the x and y polarization states. 



In (111, the operation C (i) = A*B (t) represents a matrix convolution operation. Specifically, the {i,j) entry 



of C (t) is given by 



M 



Czj W = ^a./*^oW (12) 

1=1 

where M is the dimension of the square matrices A {t) and B (t) and • denotes the convolution operator. 

B. Random Propagation Model 

We now develop a random propagation model for the MIMO optical channel. The random model we introduce 
is an extended variant of what was presented in [13] and |19|. The per-section coupling matrices U'^' and V*^ are 
modeled as independent and identically distributed (i.i.d.) random unitary matrices with arbitrary distributions. We 
assume that the propagation characteristics gf, 6^, a^, and rf are all independent random quantities. In addition, 
each of g'^, , x'^, and ol^ has zero mean identically distributed, but possibility correlated, entries. The zero mean 
assumption is not restrictive because the mean MDL, MDPS, GD, and GVD do not affect the capacity of the fiber. 
Even though the propagation characteristics are identically distributed within a particular section, they need not have 
the same distributions from one section to the other. We define (jf^ to be the standard deviation of the uncoupled 
MDLs in the fc*'* section: o-fe = w Var (g^^) = ^cyVar (kJ^). At any fixed frequency wo, the overall frequency 



response in ( lOi can be written as 

H(c^o) = Uh(c^o)Ah(c^o)VJ,(cjo) (13) 



by the singular value decomposition (SVD) of H(a;o)- In (13 i, all the matrices are random frequency dependent 
square matrices and 



KH{iOn) = dmg[e-^P\...,e^P"'j (14) 

contains the end-to-end eigenmodes, singular values of H(a;o)- We note that the end-to-end eigenmodes are not 
actual solutions to the wave equation, but rather they characterize the effective overall propagation through the fiber 
The vector p = (pi , p2, •■•, Pm) contains the end-to-end mode-dependent losses, the logarithms of the eigenvalues of 
H(a;o)H* {ujq). These quantities are obviously frequency dependent random variables as they are the logarithms 
of the eigenvalues of a frequency dependent random matrix. The accumulated mode-dependent loss variance is 
defined as 

e^<yl + <yl + - + ^l (15) 

where ^ is measured in units of the logarithm of power gain and can be converted to decibels by multiplying its 



value by 10/ In 10 ||T3| . When all sections have identical distributions for the MDLs, Equation (15i reduces to 
^^ — Ka^ because ak — cr for all k. 



III. Capacity of Multi-Mode Fibers 

In this section, we compute the capacity of coherent MMF systems under the presence of mode-dependent phase 
shifts (MDPSs), mode-dependent losses (MDLs), group delay (GD), chromatic dispersion (CD), and mode coupling. 
Table [I] summarizes the parameters governing the random propagation model presented in Section II-B Each of the 



TABLE I: Random propagation model 



fiber's frequency response 


H(a;)=H^ {u) ...W (uj) 


per-section response 


n" {uj) = u'^'A* (w) v*^- 


per-section coupling matrices 


U*-' and V-' 


uncoupled MDL 


g'^' = {9i,--,9m) 


uncoupled MDPS 


e'' = {9t...,9'i,) 


uncoupled GD 




uncoupled GVD 


a'' = {a1,...,alj) 


uncoupled MDL variance 


a^=Var(5f)=Z^VarK) 


accumulated MDL variance 


C^ =af + ai + ... + aj^ 



vectors g'^, , r'^, and cxf' has zero mean identically distributed, but possibility correlated, entries. Moreover, the 
vectors g*^'!, 9 S t''"i, and a'^i are independent of g'^^, ^, r*^^, and a''"2 for fci ^ k2- However, they can have 

that the fc*'' section propagation matrix is given by 



II-B 



■ nk ■ k ■ 2 k 1 k -/ifc - k ■2k 



the same statistical distributions. Recall, from Section 

a'' {uj) = diag (e^si 

where 6'= = diag fe-^'^i , ..., e-^^'f V T'^ = diag fg-J"^", ..., e-^'^'^^X A'' = diag ( 
G'^ = diag (e^ffi , ...,e39M j. 

A. Frequency Flat Channel Capacity 

We first study the capacity of the system when the channel's delay spread and CD are negligible. The more 



(16) 



diag e-J'^ "i,...,e-J'^ "^O, and 



general frequency selective case is handled in Section III-B In this regime, max^j |r/° — t*j\ « and max^ |aj^| ~ 



1 •••7 ^ 



.,e 



1 fc ■a'^ 

2 yA/ J'^M 



and hence t^ = t'' and aj^ — for all i and k. Therefore, the fc*'' section propagation matrix is given by 

a'' (w) = diag [e5« 

= e-^"^'diag(e59i-J«i, 

where A*^' = G'^'G'''. Therefore, the overall response can be written as 

HH = e-^"5:Li-'"u^A^V^^"...UiAiVi* 



(17) 



(18) 



10 



where XJh, ^h^ and VJ^ are obtained by applying the singular value decomposition to U^A^V^ ...U^A^V^ . 
Observe that XJh, ^h, and VJ^ are all frequency independent. The term e~^"^'==i ^ is a delay term and can be 
neglected if we assume that the transmitter and receiver are synchronized. Thus, the channel is frequency flat and 
is given by 

H = IJhAhY*h (19) 



Consequently, the input-output relationship under this frequency flat channel model in (19 1 is given by 



y = Hx + V (20) 

where x and y represent the transmitted and received vectors, respectively, and v represents the modal noise which 
is modeled as additive white Gaussian noise (AWGN) with covariance matrix NqI-m, Nq being the noise power 
density per Hz. This assumes that coherent optical communication is used and that electronic noise is the dominant 
source of noise. In addition, the fiber non-linearities are neglected under the assumption that the signal's peak to 
average power ratio (PAPR) and peak power are both low enough. This condition is not restrictive because MMFs 
have large radii and hence can support more power (relative to single mode fibers) while still operating in the linear 
region. The input-output model in ( [20| may seem identical to the wireless MIMO flat fading one. However, the 
Rayleigh fading i.i.d. model does not hold in our case because H is a product of K terms, each containing a random 
diagonal matrix sandwiched between two random unitary matrices. Moreover, the entries of H are correlated. From 
EJ, the capacity of a single instantiation of the channel in ( 19 1, when channel state information (CSI) is not available 



at the transmitter, is given by 



C(H) = logdetl'lM + ^Hff 
^^ ' SNR , 



where SNR = P/AqW^, P representing the total power divided equally across all modes and W representing 
the available bandwidth in Hz. The A^'s are the eigenvalues of HH*. If CSI is available at the transmitter, the 
capacity could be further increased through waterfilhng EOl . lITOl . In this case, the transmitter pre-processes the 
transmit vector x by allocating powers using a waterfilling procedure and then multiplies x by V//. On the other 
side, the receiver multiplies the received vector y by U|^. This effectively turns the MIMO channel into a set of 
parallel AWGN channels. In optical communications, the beamforming process assumes that the designer can couple 
the fields of different sources onto the fiber exactly as determined by N h. This procedure, though beneficial, is 
complicated as it necessitates the design of sophisticated reconfigurable mode-selective spatial filters using coherent 
spatial light modulators ETI . Il22l . 



In the above analysis, we considered the capacity of (20i for a given instantiation of H. However, since H is 



random, the channel capacity C (H) is a random variable. In the fast fading regime, the ergodic capacity, expected 
value of C (H), is desired as it dictates the fastest rate of transmission [lO]. On the other hand, in the slow fading 
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Fig. 2: Distribution of end-to-end MDL 
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regime, the cumulative distribution function (CDF) of C (H) is desired as it determines the probability of an outage 
event for a particular rate of transmission ifTOl . In either case, the cumulative distribution and the expected value of 



C (H) are both functions of the distribution of A = (A^, ..., X\.i), the eigenvalues of HH*. From ( 19 1, the matrix 



HH* = U//A|jUJf is Hermitian and its eigenvalues, the squares of the singular values of H, are real non-negative 
quantities. Recall from Section 



II-B 



that the quantity A„ = e'''" refers to the n*'* end-to-end eigenmode and the 
quantity p„ refers to the n*'' end-to-end mode dependent loss (MDL). The distribution of the end-to-end MDL 
values was studied in ifTSl where it was shown that as M tends to infinity, the p„'s become independent and 
identically distributed on a semicircle. Their analysis and simulations, however, did not incorporate the effect of 
mode dependent phase shifts (MDPSs), 6'*^'s. We now show that the statistical distribution of the end-to-end MDL 
values is unchanged even when MDPSs are incorporated. 

Theorem 3.1: The statistics of H are independent of mode dependent phase shifts. 

Proof: We show that the statistics of H'"' = U''"6'''G'''V'^* are the same as those of H''' = U'^'C'^'V'''* for all 
k — 1, ...,K. Observe that O'^ is a unitary matrix that is also random because it has random orthonormal columns. 
However, O'^ does not necessarily belong to the class of random unitary matrices as it is not necessarily uniformly 
distributed over U(M). Nonetheless, we note that the distribution of W = U'^'O'^' is the same as the distribution 
of U'^ because 



/(w) = / /(w|e'=)/(e'^)de^- 

Je'' 






/ (e'') dQ'' 



(22) 



where the second equality holds because for a given instantiation of O'^, the random matrix W|0'' has the same 
distribution as U'^ (by Lemma \1A\ in Section I-Di. Therefore, the statistics of H are unchanged when the MDPSs 
are incorporated, and thus the results in lil3J carry over to this more general setting. ■ 

Figure |2] shows that for AI ~ 100 the distribution of p„ approaches a semicircle. The distributions in Figure |2] were 



obtained by generating a large sample of channel matrices H (for M = 8, M = 52, and 100) and estimating the 
distributions of the logarithm of their singular values. Appendix IA] explains how we can generate random unitary 
matrices, which are needed to create samples of H, from matrices with i.i.d. complex Gaussian entries. In this 
section, we use the following notation 

where x denotes the energy-normalized version of the random variable x. The average capacity of C (H) is given 
by 



M 



L'avg — / ^ ^ 



log ( 1 + -j^K 



b/s/Hz 



(24) 



where the average is taken over the statistics of the end-to-end MDL values ifTol . Figure 3a shows the average 
capacity of MMFs for various values of M and ^ = 4 dB. The capacity of the system increases with an increasing 
number of modes. This is intuitive because as the number of modes increases, the fiber's spatial degrees of freedom 
are increased. Figure [3b] shows the effect of accumulated MDLs on the average capacity. An increasing value of 
^ results in a capacity equivalent to that of a fiber with fewer modes. This means that as ^^, the accumulated 
mode-dependent loss variance, increases the system loses its spatial degrees of freedom. 
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(a) Capacity of MIMO MMF systems at ^ = 4 dB 




(b) Effect of MDL for M=100 



Fig. 3: Ergodic Capacity Anlysis 



B. Frequency Selective Channel Capacity 

When chromatic and intermodal dispersion are taken into account, the fiber's frequency response H {uS) becomes 
frequency selective. Under the same linear assumptions as in the previous section, the input-output relationship is 
given by 

y(i)=H(i)*x(t)+v(t) (25) 



where v (t) is a vector Gaussian process, x (t) is the input, and y (i) is the received signal. Recall, from Section 



H-A that H (wq) can be written as 



H (wo) = Uh (t^o) Aff (cjo) Vh (^o) 



(26) 



where Uh ('^o). ^h ('^o). and VJ^ (wq) all depend on wq. From ifTol . the capacity of a single instantiation of 
H (w), when CSI is not available at the transmitter, is equal to 



C 



1 



2ttW 



27tW 



logdet In 



SNR 



H (uj) H* (w) dw b/s/Hz 



(27) 



where W is the bandwidth of the system in Hz and SNR = P/NqW ifTol . This capacity can be achieved by 
Orthogonal Frequency Division Multiplexing (OFDM) with N sub-carriers (as N tends to infinity). MIMO OFDM 
modulation is a popular modulation scheme in wireless communications and is currently being developed for the next 
generation optical systems ll23l . The maximum achievable capacity of a MIMO-OFDM system with TV sub-carriers 
is 



C = 



N 



-^logdetflN 

^ N M 



SNR 



M 
SNR 



H,H*, 



\2 

M "'' 



b/s/Hz 



(28) 



i — l n—1 

where H^ = H (cj^) and A^^ ^ is the n*'' eigenvalue of HiH*^. When CSI is available at the transmitter, waterfilling 
can be performed to allocate optimal powers across sub-carriers and transmitters. 



In the above analysis, we considered the capacity of (25 1 for a given instantiation of H(a;). In our work, we 



focus on analyzing the expected capacity of the frequency selective system which is given by 

N M 

N ■ 



a 



E 



N M 



1 



SNR- 2 



M 



b/s/Hz 



(29) 



i—l n—1 

To begin with, if we assume that, in each section, all modes experience the same random loss (i.e., the entries of g*^ 
are perfectly correlated), then G'' = 659 Ij^. Furthermore, assume that the K sections are statistically identical. 
Therefore, <Jk = o for all fc and i^ = Ka^ . In this case, the overall response is given by 

H(a;) = H^' (cj) ...H^ (cj) 

= e5nii9''u^e^T^A^V^"...UieiTiAiVi' (30) 

Observe that, even though H (a;) is a function of cj, H (a;) H (w)* = e^h=\ 9 \^^ jg independent of uj. This means 
that A^ j = A = ei^k=i 9 j^ independent of the frequency index i and the mode number n. Thus, the average 
capacity of the fiber is given by 



Ca 



ME 



log ( 1 + -^A^ 



(31) 
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where the average is taken over the statistics of A^. Observe that the average capacity scales linearly with M, the 
number of modes, and thus the fiber has M degrees of freedom. Therefore, neither group delay nor chromatic 
dispersion affect the average capacity of the fiber. 

We now derive the capacity for the general case (i.e., when the entries of g'' are potentially independent). The 
following theorem shows that the statistics of H (a;) are independent of lo. 

Theorem 3.2: The statistics of H (w) are independent of ui. 



Proof: Using the same technique as in the proof of Theorem 3.1 we can show that the statistics of H''' (cj) = 
jjkQkrYkji^kQk-yk* ^j.g jj^g g^j^^g ^^ jj^g gj-^tistics of H''' = U'^G'^V'^' by showing that the distribution of W'^ = 
UfcQferp/c^fe jg equal to the distribution of U*^. Thus, the statistics of H'^ (cj) are independent of cj. ■ 

This result shows that the statistics of the eigenvalues of H^H* are identical for all i. Therefore, the average 
capacity expression can now be rewritten as 

M 

b/s/Hz (32) 



Cavg^^IE 



log ( 1 + -j^>^l 



n=l 

which is identical to the average capacity of frequency flat optical MIMO systems. Therefore, the results of the 
previous section carry over to the frequency selective case. 

IV. Input-Output Coupling Strategies 

The capacity analysis presented in Section [III] is important, but it only serves as an upper limit on the achievable 
rate. This limit can only be achieved by making use of all available spatial modes. In theory, one can always design 
a fiber with a sufficiently small core radius such that a desired number of modes propagate through the fiber ||6l. 
In reality, one has to rely on currently installed optical fibers and available technologies. The state-of-the art OM3 
and OM4 MMF technologies have core radii of 50 /im with hundreds of propagation modes. Unfortunately, having 
a 100 X 100 MIMO system is neither physically nor computationally realizable at the moment. This means that a 
more careful look at the effective channel capacity has to be considered. This is why we now focus on the case 
when Nt transmit laser sources and Nr receivers are used. For most of this section, we assume that intermodal 
and chromatic dispersions are negligible. Even though this may seem like a restriction, this assumption serves to 
simplify the discussion and presentation of input-output coupling strategies. The results and procedures we present 
offer insight and can be extended to the more general frequency selective case. 

A. Input-Output Coupling Model 

The input coupling is described by Cj, an M x Nt matrix, and the output coupling is described by Co, an 
Nr X M matrix. Here, M is much larger than Nt and Nj. and the overall response is given by 

Ht = CoHW...H(i)Ci 

= CoHCi (33) 



Therefore, for a single instantiation of Hj, the capacity of the channel is given by 

C (Ht) = logdet ("in, + ^HtH^ (34) 

The input-output coupling coefficients (entries of Ci and Co) are complex quantities capturing the effect of both 
power and phase coupling into and out of the fiber These coefficients are determined by the system geometry 
and launch conditions. For example, in order to study the input coupling profile of each light source one needs to 
specify its exact geometry and launching angle, and then solve the overlap integrals: two dimensional inner products 
between the laser's spatial patterns and those of each mode 

0j {x, y) (ps^ {x, y) dxdy (35) 

where Cy is the (i,j) entry of Ci, 0^ {x,y) is the i*'' mode spatial pattern, and 0^. {x,y) is the f*^ laser source 
spatial pattern. However, this procedure is cumbersome and offers little insight on the underlying channel physics. 
In what follows, we provide a simple condition on the input-output couplers. This condition will prove useful when 



we present an input-output scheme that maximizes the achievable rate of the overall system (Section IV-B I and 



impose a statistical model for Ci and Co (Section IV-C i 



Theorem 4.1: If we neglect the power lost due to input coupling inefficiencies, then a necessary and sufficient 
condition for Ci to be an input coupling matrix is given by 

(ci,Cj) = (5„- (36) 

where ci represents the «*'' column of Cj and (a, b) denotes the standard Euclidean inner product between the 
vectors a and b. This means that the columns of Cj should form a complete orthonormal basis for C^'. Similarly, 
if we neglect the power lost due to output coupling inefficiencies, then the rows of Cq should form a complete 
orthonormal basis for C^''. 

Proof: Satisfying the energy conservation principle requires that 

||Cix|p = ||x|p VxeC^' (37) 

This means that the energy of the input vector should be equal to the energy of the mode vector at the input of the 
fiber This condition holds whenever the mapping Ci is a linear isometry mapping. In the special case where Cj 
is a square matrix, a classical result in linear algebra states that Cj has to be a unitary matrix [241. However, Cj 
is a tall M x Nt (M ^ Nt) rectangular matrix. In this case, the condition in ( [37] i can be rewritten as 

(Cix,Cix) = (x,x) VxeC^' (38) 

or equivalently as 

(x, [Ci*Ci-InJx) = VxeC^* (39) 
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If Cj Ci = Inj , the condition in ( 39 1 holds and Cj preserves the norm. This choice ensures that the columns of 



Ci form a complete orthonormal basis for C^'. However, this only proves the sufficiency part of the theorem. To 



prove the necessity part, we consider B = CJCi — iNt and show that if (39 1 holds, then it is equal to zero. It 
can be easily verified that if B is a diagonal matrix, then (x, Bx) = Vx G C^* implies that B = 0. The same 
observation holds if B is diagonalizable. In this case, one can choose an orthonormal basis of eigenvectors and map 
(x, Bx) = Vx G C'^* to (x, Dx) = Vx G C^* where D is a diagonal matrix. In our case, B is Hermitian and 
hence it is unitarily diagonalizable so that B = or alternatively, CjCi = iNt as desired. A similar proof can be 
carried out to show that CqCq = In^. ■ 

We note that even though CjCj = In^, CiCj ^ iNt because Cj is of full column rank, but not of full row rank. 
We now use this property to show that Cj and Co should have a special structure. 
Theorem 4.2: The input-output coupling matrices Cj and Co can be expressed as 



Ci = Ui 



0(M-Nt)xNt 



Vi* (40) 



Co = Uo [In.On,x(m-n.)] V^ (41) 

where Ui and Vq are M x M unitary matrices, VJ is an Nt x Nt unitary matrix, and Uo is an Nr x N^ unitary 
matrix. 

Proof: By the singular value decomposition (SVD), Ci = UjAiVj and Co = UoAoVq ||24l. The non-zero 
singular values of Ci are the square roots of the eigenvalues of CjCi, and CjCi — iNt- A similar argument holds 
for Co. ■ 

B. Input-Output Coupling Strategies 

In this section, we assume that Nt = Nr- When CSI is available at the transmitter and the design of Ci and Co 
is affordable, a desirable choice for the input-output couplers is the one that maximizes the system's capacity 

SNR 
(Ci.Co) " V "" ^t 

where the A^^'s are the eigenvalues of HtH^. We note that Cj'' and Cq^ should have a structure compliant with 
( |40l i and ( |4T] i, respectively. Instead of solving the above constrained optimization problem, we provide an intuitive 
choice for (Ci,Co) and argue that it leads to a maximized overall capacity through simulations. 



(CiP\C°P*) = arg max logdet ( In, + ^r^HtH^ 
V ^ ^ J (Ci.Co V ' TV* 



Proposition 4.1: The capacity of the overall system in (34i is independent of the choice of Vj and Uo from 
^ and (|4T). 



Proof: The capacity of the overall system is given by 



C(Ht) - logdet In, 



logdet In, 



= log det iNt 



log det iNt 



SNR 

SNR 
SNR 

SNR 

lv7 



HtHt 






UoAoVJ)HUiAiAi*Ui*H*VoAJ)U^ 



AoVJ)HUiAiAi*Ui*H*VoA^ 



(43) 



Therefore, the capacity of the overall system is independent of VJ and Uq and hence, without loss of generality, 
we will assume that they are both equal to the identity matrix. ■ 

The following input-output coupling scheme is suggested 



Ci=Vh 



I 



Nt 







(M-Nt)xNt 



Co — flN„Onvf- 



O — [-lNrl^(M-Nt)xNtJ"^H 



\J* 



(44) 
(45) 



where Vh and Uh have been defined in ( 19 1. Choosing Vq = Uh, Uq = In^ Ui = Vh, and Vj = In^ leads 
to an overall response given by 



Ht — [InO(m-n)xn] Ah 
= diag(Ai,...,A7v) 



In 

0(M-N)xN 



(46) 



Thus, the overall MIMO channel is transformed into a set of parallel AWGN channels. Moreover, since the SVD 



in (19 1 sorts the singular values in decreasing order, the signal energy has been restricted to the Nt (out of M) 
least lossy end-to-end eigenmodes. The capacity achieved by this choice of input-output coupling is 



Nt 



C(H,)-^log 1 



SNR 



b/s/Hz 



(47) 



This capacity could be further increased by pre-processing x via a diagonal power allocation matrix K using 
waterfilling. Our strategy is intuitive since we only have Nt degrees of freedom so it would be wise if we use the 
Nt least lossy end-to-end eigenmodes to transmit. We note that even though the effective end-to-end fiber response 
shows that we have used the Nt best end-to-end eigenmodes only, Nt signals were coupled to and collected from all 



the available physical modes at the input and output of the fiber. Nonetheless, achieving ( 47 1 requires, as discussed 
before, having CSI at transmitter and using adaptive spatial filters which is typically hard to implement. Even though 
we did not prove that the above strategy is capacity optimal, our simulations section will show that it appears to 
maximize the capacity of the overall system. 



C. Random Input-Output Coupling 

The design of reconfigurable input-output couplers is expensive and assumes the availabihty of CSI at the 
transmitter (which is only feasible when the channel is varying slowly). More importantly, in many cases, the 
coupling coefficients are affected by continuous vibrations and system disturbances. Thus, full control over Cj and 
Co is not always affordable. In this section, we analyze the capacity when the user does not have control over 
Ci and Cq. This will give us better insight on the achievable capacity of MIMO MMF systems. We model the 
coupling coefficients as time varying random variables and impose a physically inspired distribution that respects 
both the fundamental energy preservation constraint and the maximum entropy principle. Even though we focus on 
describing the statistical model of Cj, our discussion applies equally well for Cq. 

For an M x M square matrix A, the energy conservation principle confirms that A should belong to U [M). 
It was proven in ifTSl that since a Haar measure exists over U (Af ), one could define a uniform distribution over 
U [M). Therefore, we choose a random setting where the input coupling matrix Cj has its Nt columns randomly 
selected from a square matrix A that is uniformly distributed over U {M). This distribution ensures that the columns 
of Ci form a complete orthonormal basis for C^* and gives equal probability measure for all such possible vectors. 
In other words, Ci is uniformly distributed over a Stiefel manifold VjVj (C^^). Appendix [a1 shows how we can 
generate Ci and Co from an M x M matrix with i.i.d. Gaussian entries. The ergodic capacity of the overall 
system can now be computed by averaging over the statistics of the input-output couplers and the statistics of the 
fiber response. Similarly, one could also compute the probability of an outage event by obtaining the cumulative 
distribution function (CDF) of the capacity, which now depends on the statistics of Cj and Co- 

D. Discussion 

We have evaluated the capacity of both controlled and uncontrolled MIMO MMF systems. As discussed in 
Section |IV-B[ the controlled case refers to the case when CSI is available at the transmitter side and there is full 
control over the input-output couplers. The uncontrolled case refers to the random coupling model presented in 



Section IV-C In our simulations, we numerically computed the ergodic capacity using (24 1, with H replaced by 



Ht = CqHCi. The average in (24 1 is taken over the statistics of the channel and the input-output couplers for 
the uncontrolled case. For reference, we included plots of the capacity when 

• all mode dependent losses are equal to zero and there is no mode coupling {K ~ 1 and ^ — 0); hence the 
channel has unity eigenmodes. A fiber with such properties will be referred to as an ideal fiber. 

• the fiber core radius is chosen so that only Nt modes can propagate. In this case the input and output coupling 
matrices are unitary matrices. 



In this analysis, we consider Nt = Nr = 4, K = 256, £, = 4: dB, and M = 100. Comparing Figures l4] and 3b 



we observe that the capacity of a 4 x 4 system over a 100-mode fiber is inferior to the intrinsic capacity of the 
fiber (i.e., when all the modes are used). This result is expected since we are using 4 out of 100 available degrees 
of freedom. At moderate SNR values the loss in capacity is about 6 dB. On the other hand, observe, from Figure 
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Fig. 4: Achievable capacity of a 4 x 4 MIMO MMF system 



|4] that the performance of an uncontrolled 4x4 system over a 100-mode fiber is close to that of a system with 
4 modes. Thus, cuiTently installed fibers could be used without significant loss in capacity. We also note that, by 



using the input-output coupling strategy presented in Section IV-B performance equal to that of an ideal fiber can 
be achieved. This is explained by revisiting Figure [2] which shows the probability distribution of the end-to-end 
MDL values when M — 100. We observe that in this case we only use the best 4 eigenmodes to transmit the 
signal. As such, it is highly probable that these 4 (out of 100) modes will have close to zero end-to-end mode- 
dependent losses, and thus the performance is almost equal to that of an ideal fiber (even when ^ is large). The 
larger M is, the closer the capacity of a controlled fiber can get to that of an ideal fiber. Finally, one could argue 
that coupling a reasonable number of inputs to a fiber with hundreds of modes is advantageous since the fiber's 
peak power constraint is proportional to the number of modes (recall that more propagation modes means larger 
core radius). This means that compared to an A^t-mode fiber, a higher capacity could be achieved if we signal over 
an M ^ A'i-mode fiber since the total power budget can now be increased. 



V. Conclusion 

MIMO communications over optical fibers is an attractive solution to the ever increasing demand for Internet 
bandwidth. We presented a propagation model that takes input-output coupling into account for MIMO MMF 
systems. A coupling strategy was suggested and simulations showed that the capacity of an Nt x Nt MIMO system 
over a fiber with M S> Nt modes can approach the capacity of an ideal fiber with Nt modes. A random input-output 
coupling model was used to describe the behavior of the system when the design of the input-output couplers is 
not available. The results illustrated that, under random coupling, the capacity of an Nt x Nt MIMO system over 
a fiber with A/ ^ Nt modes is almost equal to that of an A^j-mode fiber. 
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Appendix 

Our method for generating random unitary matrices is based on the QR decomposition procedure ifTSl . In this 
case, A is constructed as follows: 

1) Generate an M x M matrix Z with i.i.d. complex Gaussian entries. 

2) Obtain the QR decomposition of Z; Z = QR. 



3) Form the following diagonal matrix: 



/ '•11 
kill 



'•22 
I '•22 I 



(48) 



\ \rMM\ / 

where {rii]fLi are the diagonal entries of R. 
4) Let A = AQ. 
In the above construction, A is obviously unitary since Q is unitary. Furthermore, it can be shown that A has a 
uniform distribution over U (ill). To generate the input coupling matrix, the following method is used: 

1) Generate an M x M unitary matrix A (as described above). 

2) Choose Nt columns randomly from A to form Cj. 

A similar approach can be taken to generate Cq. In this case, Nj. columns are selected randomly from A to 
represent the rows of Cq. 
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